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ABSTRACT 


A series  of  linear  programming  models  are  used  to  clarify 
and  extend  a measure  of  efficiency  introduced  by  M.  J.  Farrell.  The 
duals  to  these  models  are  shown  to  yield  estimates  of  production 
coefficients  from  the  same  empirical  data  and  computations  that  yield 
the  measures  of  efficiency.  The  nature  of  the  resulting  production 
functions  and  ways  in  which  they  differ  from  more  customary  ones  are 
discussed  en  route  to  synthesizing  the  associated  cost  functions  and 
other  such  (economic)  relations.  Methods  for  adjusting  observations 
are  suggested  for  economic  inferences  and  policy  applications. 

Multiple  output-multiple  input  extensions  are  effected  via  a new 
definition  of  efficiency  which  involves  a nonlinear  model  for 
determining  the  optimal  input  and  output  weights  from  observational 
data.  The  theory  of  fractional  programming  is  used  to  secure  ordinary 
linear  programming  models  from  which  the  weights  and  efficiency 
measures  may  also  be  obtained. 
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I.  Introduction 

Economic  studies  of  resource  allocation  and  related  decision-making 
activities  of  governmental  agencies  and  programs  can  encounter  difficulty 
because  of  the  form  in  which  the  underlying  economic  constructs  are 
stated.  Oriented  mainly  toward  private  markets  and  entities,  the  constructs 
from  economic  theory  usually  take  the  form  of  extremal  relations.  The  concept 
of  a production  function  which  we  shall  be  utilizing  here  is  a case  in  point. 

This  function  assumes  the  form  of  an  extremal  relation  from  the  property  that 
output  is  always  maximized  from  any  combination  of  Inputs  that  may  be  utilized. 

When  perfect  competition  prevails  in  relevant  private  sector 
markets,  then  considerations  like  the  following  may  be  relied  upon:  (1)  All 
firms  in  the  same  market  have  approximately  the  same  efficiency  as  the  most 
efficient  among  them  and  (2)  the  latter  firms,  i.e.,  the  relatively  most 
efficient,  are  either  at  or  near  the  maximum  efficiency  that  the  existing 
state  of  technology  admits.  The  first,  which  may  be  referred  to  as  relative 
efficiency,  is  the  one  we  shall  focus  upon.  The  second  almost  inevitably 
carries  with  it  such  properties  as  freedom  of  exit  and  entry  in  pursuit  of 
maximum  profitability,  etc.,  and  hence  it  also  carries  with  it  possible  flows 
from  one  "industry"  to  another,  which  are  not  characteristic  of  governmental  (and 
like)  activities  which  form  the  central  concern  for  the  formulations  in  this  paper. 

This  paper  is  directed  toward  approaches  to  relative  efficiency, 
e.g. , for  governmental  activities,  by  means  of  which  we  can  either 

(a)  evaluate  the  efficiency  of  various  decision-making  entities  participating 

» *» 

in  like  programs  or  (b)  adjust  and  allow  for  efficiency  variations  among 

these  entities  in  order  to  bring  the  thus  adjusted  observations  into  conformity 


with 


underlying  concepts  involving  extremal  relations. 
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The  latter  topic  leads  to  a technique  of  data  analysis  which  we 
shall  refer  to  as  Data  Envelopment  Analysis,*  but  which  we  must  reserve  for 
separate  treatment  in  order  to  deal  adequately  with  the  topics  of  the  present 
paper.  Hence  we  shall  here  only  adumbrate  this  DEA  approach. 

This  is  to  say  that  our  present  paper  will  be  mainly  of  a mathematical- 
economics  variety  which,  for  the  most  part,  leaves  aside  the  further  topics 
of  statistical-econometric  data  treatment.  This  is  a natural  way  of  proceeding, 
we  might  remark,  since  our  DEA  approach  posits  a prior  adjustment  of  the  data 
to  bring  them  into  conformance  with  the  underlying  extremal  relations  required 
by  the  constructs  in  economics  before  undertaking  the  related  statistical 
hypotheses  tests  or  estimates.  Our  present  paper  is  also  of  value  for 
empirical  use  in  its  own  right,  too,  since  it  supplies  new  methods  of 
effecting  estimates  of  these  extremal  relations  via  mathematical  (as 
distinguished  from  statistical)  means  which  can  be  useful  in  a variety 
of  contexts.  For  example,  the  methods  in  the  paper  may  be  used  when  one 
is  satisfied  that  the  data,  (1)  are  not  subject  to  "large"  amounts  of 
error,  and  (2)  cover  the  relevant  entities  so  that  sampling  of  a statistical 
variety,  say,  is  not  a source  of  data  variation. 

That  is,  like  in  engineering  and  other  sciences, we  are  supposing 
2 

that  the  paradigms  of  statistical  estimation  and  testing  need  not  be  used 
when  such  errors  are  small  relative  to  other  considerations.  In  a similar 
vein,  one  further  justification  for  our  proposed  approach  arises  from  the 
fact  that  customary  statistical  econometric  techniques  based  on  "averaging," 

*See  Rhodes  [30 ] . 

2 

The  term  paradigm  is  used  in  the  sense  of  Kuhn  [26 ] . 


and  like  "central  tendency"  relations,  are  not  wholly  suited  to  the  task 
of  estimating  extremal  relations.^"  The  estimating  methods  we  are 
suggesting,  on  the  other  hand,  are  based  on  the  duality  relations  of 
mathematical  programming  and  hence  on  the  extremal  mathematics  which  underlies 
these  relations.  As  such  they  are  well  suited  to  the  estimation  of  extremal 
relations  from  empirical  data  provided  we  can  reorient  these  mathematical 
programming  constructs  from  their  usual  pre-decision  uses  as  planning  guides 
and  thereby  obtain  a complementary  use  by  applying  these  same  methods  and 

theorems  to  post-decision  (empirical)  data.  Such,  of  course,  is  the  main  thrust 

of  the  present  paper  in  which,  we  also  note,  we  restrict  ourselves  to  inequality 

systems  so  that  we  may  utilize  ordinal  as  well  as  cardinal  arrays  of  data. 

Having  now  highlighted  some  of  our  objectives  in  the  present  paper, 

we  should  pause  to  observe  that  empirical  approaches  to  the  study 

of  production  functions  have  not  always  relied  on  the  extremal  formulations 

of  economic  theory.  Thus,  R.  A.  Frisch,  in  what  is  probably  the  earliest 

study  of  production  functions  at  the  individual  plant  level t recognized  the 

difficulty  involved  in  his  use  of  a least-squares  regression  approach  for 

2 * 

estimating  such  extremal  relations.  In  particular,  he  explicitly  noted  the 
need  for  a production  function  formulation  based  on  the  "averaging"  that  such 
statistical  estimating  procedures  imply,  at  least  as  a matter  of  practical 

application. 

For  purposes  such  as  statistical  prediction  of  actual  behavior,  etc., 
this  kind  of  "practical  empiricist"  approach  can  have  advantages.  It  should 
be  recognized,  however,  that  a reliance  on  such  "averaging"  methods  produces 


See  the  discussions  in  [1],  [2]  and  [3]. 

2 

An  excellent  discussion  of  Frisch's  concept  as  well  as 
empirical  uses  production  function  studies  may  be  found 

other  approaches  to 
in  [20] . 

1 

A 

estimates  and  relations  which  do  not  then  have  direct  access  to  the  large 
body  of  theorems  and  methods  which  are  available  from  standard  versions  of 
micro-economic  theory.^  This  is  to  say  that  departures  from  the  defined 
extremal  relations  on  which  that  theory  rests  require  either  separate  and 

explicit  justifications  to  utilize  the  related  "marginalist"  conventions 

/ 

and  theories  or  else  entirely  new  formulations  must  be  essayed  before 

anything  like  these  conventions  can  be  utilized. 

In  many  ways  the  latter  course  is  the  one  that  will  be  followed 

in  this  paper.  Although  the  production  function  concept  that  we  shall  be 

employing  is  not  wholly  new,  having  been  employed,  e.g.,  by  M.  Farrell  [18] 

2 

and  others,  it  does  differ  from  the  ones  that  are  customarily  employed.  We 
also  alter  and  extend  even  the  production  functions  that  Farrell  et.al. 
utilized  and  we  also  supply  new  methods  of  estimation  and  interpretations. 

En  route  to  our  fully  developed  formulation 
we  6hall  try  to  stay  fairly  close  to  standard  micro  economic 
concepts.  The  point  to  be  made  for  the  present,  however,  is  that  our 
production  functions  will  be  based  on  optimizing  models  ano  methods.  This 
means  that  we  shall  feel  free  to  utilize  the  results  of  such  optimizations  to 
characterize  possible  economizing  in  resources  associated  with  inputs  and/or 
augmentations  associated  with  outputs.  We  shall  also  utilize  standard  economic 
constructs  such  as  marginal  productivities,  etc.,  and  indeed  we  shall  also 
utilize  still  other  optimizations  to  derive  still  other  economic  relations 
such  as  the  economic  opportunity  cost  functions  associated  with  our  production 
functions. 

All  of  the  above  will  be  formulated  within  the  characterization  we 

*In  many  of  the  current  energy  studies  the  failure  to  make  such  adjustments 

seems  especially  in  need  of  explicit  justification  if  only  because  of  the 

likely  presence  or  "waste"  (i.e.,  failure  to  achieve  efficiency  frontiers)  in 

the  presence  of  a large  and  rapid  price  change  for  a previously  greatly  "underpricec 

factor  of  production  like  energy.  See  [15]. 

2 

Actually  neither  Farrell  nor  his  followers  ever  developed  or  employed  these 
production  function  concepts  per  se. 
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gave  for  relative  efficiency  in  our  opening  two  paragraphs.  This  means 

that  we  will  not  be  concerned  with  best  alternative  uses  (e.g.,  in  some 

other  industry)  but  only  with  the  release  of  resources,  say,  for  which 

aome  other  use  is  available.  This  will  also  all  be  accomplished  within 

the  limits  of  specified  models  and  methods  — which  will  become  clearer 

as  we  proceed.  In  any  case  our  developments  will  be  formulated  around 

"decision-making  entities"  that  we  shall  refer  to  as  "firms."  * Such 

"firms"  will  then  be  said  to  constitute  an  "industry"  or  to  be  members  of 

a "program"  when  they  have  all  of  their  inputs  and  outputs  in  common. 

With  (1)  all  inputs  and  outputs  in  common  at  non-zero  values  we  can 

obtain  certain  simplifications  in  dealing  with  issues  like  scaling  to 

2 

common  units,  etc.,  and  (2)  utilizing  observations  from  individual  firms 
permits  us  to  avoid  troublesome  issues  of  aggregation,  etc.  Together 
these  two  approaches  permit  us  to  obtain  added  flexibility  for  synthesizing 
a variety  of  production  functions  which  may  then  be  aggregated  or 
disaggregated  to  varying  degrees  in  accordance  with  certain  general 
prescriptions  that  we  shall  supply. 


These  may  also  be  different  decision-making  entities  within  a single 
agency  such  as,  e.g.,  the  activities  under  a single  government  department 
or  program  director. 

2 

See  the  appendix  to  this  article. 
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Finally,  we  shall  make  explicit  contact  with  ongoing  bodies  of 

research  in  the  following  manner.  First,  we  shall  explicitly  relate  our 

developments  to  earlier  work  by  M.J.  Farrell  and,  indeed,  the  first  part 

1 

of  our  development  will  rest  on  this  work.  After  we  have  eliminated  or 

2 

modified  some  of  Farrell's  concepts,  however,  we  shall  undertake  certain 
new  extensions.  In  particular  we  shall  provide  a linear  programming  formula- 
tion which  via  its  dual  automatically  supplies  the  wanted  production  coefficient 

estimates  that  Farrell  (and  others)  could  not  supply  without  a good  deal  of 
3 

extra  effort.  That  is,  this  effort  had  to  be  undertaken  in  addition  to  the 

work  required  to  secure  the  wanted  estimates  of  decision-making  efficiencies 

whereas  our  one  linear  programming  approach  secures  both  sets  of  estimates 

4 

with  the  same  computations. 


1See  [12). 

2 

E.g.,  Farrell's  use  of  "points  at  infinity"  to  complete  his  analysis,  as 
discussed  in  [13], 

3 

F^rsund  [20]  specifically  noi.es  the  fact  that  Farrell  fails  to  provide  what 
is  needed  for  a completely  satisfactory  characterization  in  these  respects. 

4 

Our  model  is  also  formulated  in  a way  that  provides  easy  access  to  standard 
linear  programming  codes.  In  this  connection,  however,  we  need  to  note  that 
Alan  Hoffman  suggested  a linear  programming  formulation  at  a very  early  date. 

See  pp.  284-285  in  [18].  In  fact,  the  formulation  which  Hoffman  presented  as 
a discussant  in  [18],  along  with  his  suggestion  for  the  use  of  Lemke's  [29] 

"dual  method"  was  subsequently  employed  with  considerable  gain  in  computing 
efficiency  by  Farrell  and  Fieldhouse  in  [19].  Although  we,  too,  shall  employ 
Lemke's  "Dual  Method"  as  well  as  Dantzig's  [16]  Simplex  Method  of  solution  - 
see  Chapter  XI  in  [9]  for  a combined  geometric  interpretation  of  both  the 
Dual  and  Simplex  Methods  - we  need  to  observe  that  our  formulation  differs 
from  Hoffman's.  His  formulation,  which  was  restricted  to  the  single  output 
case,  is  (a)  presented  in  equality  form  and  he  does  not  deduce  our  inequality 
form  and  (b)  carries  with  it  such  concepts  as  "points  at  infinity"  by  approxi- 
mation with  "large"  real  multiples  of  unit  sectors.  Moreover  the  uses  of  duality 
with  related  interpretations  did  not  enter  into  Hoffman's  discussion  in  any  way. 


With  the  above  tasks  completed  we  shall  then  undertake 

to  make  explicit  contact  with  the  Shephard-Samuelson^  duality 
2 

theory.  This  will  make  available  yet  another  body  of  developments  in 
economics  by  means  of  which  one  can  move  from  production  functions  to  cost 
functions  and  related  pricing  and  costing  theories  as  well.  Here  we  shall 
only  carry  the  analysis  far  enough  to  make  the  indicated  contact  although 
we  have  elsewhere  shown  [15]  that  Shephard's  "gauge  function"  is  the  complement 
of  Farrell's  efficiency  measure  — and  we  have  indicated  other  relations  as  well 
so  that,  e.g.,  an  extension  of  "Shephard's  lemma"  is  available  for 
the  kind  of  inequality  analysis  which  is  required  for  many  types  of  govern- 
mental activities. 

Such  governmental  activities  frequently  involve  multiple  outputs  and 
multiple  inputs  with  no  evident  weightings  readily  available  via  market 
prices,  say,  or  other  such  objectively  verifiable  magnitudes.  We 
nay  perhaps  make  the  point  at  issue  sufficiently  clear  by  means  of  educational 
studies.  If  one  wants  to  move  from  the  usual  scalarizations  such  as,  e.g., 
income-earning  ability  as  a single  output  from  all  educational  inputs,  then 
one  confronts  the  need  for  weighting  cognitive  outputs  such  as  "arithmetic 
ability"  relative  to  affective  outputs  such  as  "positive  attitude  toward 
the  community"  relative  to  outputs  of  psychomotor  skills  such  as 


See  Shephard  [36 ] and  [37 ] and  Samuelson  [32 ] . 

^We  have  elsewhere  argued  that  this  is  better  regarded  as  a version  of  transform 
theory  in  mathematics  not  only  to  avoid  confusion  with  the  duality  theory  of 
mathematical  programming  but  for  other  reasons  as  well.  See  [15]. 
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"ability  to  tread  water  in  a swimming  pool  and  turn  from  front  to  back  in 
a specified  time."  The  latter,  suitably  measured  as  an  index,  say,  of 
"bodily  control  and  appreciation"  can  be  regarded  as  an  output  which  has  "some 
value,"  but  that  value  is  not  easily  referred  to  any  market 
system  for  a more  or  less  objective  determination  of  its  relative  weight. 

The  same  applies  to  "affective  outputs"  of  the  kind  already  indicated  and  also, 
to  a lesser  extent,  to  "cognitive  outputs"  such  as  arithmetic  abilities. 

Similar  remarks  may  apply  to  the  inputs.  "Amount  of  teacher  time" 
may  be  referred  to  salaries  and  perhaps  further  adjusted  for  inter-school 
and  inter-program  comparisons  in  a seemingly  reasonable  manner.  But  great 
difficulties  may  be  encountered  in  dealing  with  inputs  such  as  "the  time  and 
efforts  of  community  leaders"  and/or  "parents,  and  others,"  which  are  evidently 
of  value,  but  which  have  no  ready  market  or  other  such  referent  for  securing 
suitable  sets  of  weights. 

Since  we  want  to  deal  with  these  multiple  output-multiple  input 
situations  directly — i.e.,  without  scalarizing  in  an  ex-cathedra  fashion — 
we  shall  proceed  as  follows.  First  we  shall  extend  the  customary  definition 
of  "efficiency"  as  a ratio  of  "wanted  outputs"  to  "valued  inputs"  to  formulate 
a new  nonlinear  programming  problem.  This  formulation  will  enable  us  to 
determine  the  associated  weights  objectively  and  in  an  optimal  manner  from 
observational  data.  That  is,  the  model  together  with  its  interpretation  will 
make  the  nature  of  the  optimization  evident.  The  weight  determination  will 
then  be  objectively  verifiable  by  anyone  who  applies  this  model  to  the  same 
body  of  data  and,  allowing  only  for  the  case  of  alternate  optima,  a different 
choice  of  weights  will  require  some  other  specified  model  in  any  case  where 
the  ratio  definition  of  efficiency  is  applied. 


The  above  model,  and  its  interpretation,  will  provide  conceptual 
clarity.  Because  the  resulting  problem  is  nonconvex  as  well  as  nonlinear, 
however,  some  additional  developments  will  be  needed  for  purposes  of  solution. 

Via  the  theory  of  fractional  programming,^  which  we  have  elsewhere  formulated, 
however,  we  shall  be  able  to  show  how  tc  transform  this  nonlinear  and  non- 
convex problem  into  an  ordinary  linear  programming  equivalent. 

This  transformation,  as  we  shall  see,  3lso  provides  a natural 

generalization  from  the  single  output  to  the  multiple  output  case.  The 

linear  programming  problem  resulting  from  this  fractional  programming 

2 

transformation  also  has  a dual.  This  means  that  all  of  the  preceding 
results  for  estimation  and  cost  function  relations,  etc.,  go  over  in  a 
readily  apparent  manner  to  this  more  general  case.  In  other  words,  our 
new  estimation  methods  for  extremal  relations  go  over  to  the  estimation 
of  multiple,  as  well  as  single,  output  relations. 

Because  these  ideas  involve  a rather  complex  array  of  considerations, 
we  have  elected  the  following  presentation  strategy.  Our  initial  development  will 
be  rather  leisurely  and  conducted  in  some  detail.  The  numerical  illustrations  as 
veil  as  the  mathematical  developments  and  interpretations  that  we  shall 
supply  are  intended  to  give  the  reader  some,  "feel"  for  what  is  involved. 

To  a considerable  extent  we  shall  also  try  to  relate  our  discussion  to 
familiar  developments  (like  isoquant  analysis)  ana  already  available  literature 


This  theory  is  complete  as  observed  in  [7]  and  [10],  so  that  no  difficulty 
need  be  encountered  because  of  zeros  in  the  denominators  of  the  indicated 
ratios — or  any  other  like  aberrant  behavior. 

2 

I.t.,  a linear  programming  dual  as  distinguished  from  the  dual  to  the 
corresponding  fractional  programming  problem.  See,  e.g.,  Jaganathan  [23] 
and  Schaible  [32].  Sec  also  Bector  [5]. 
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(such  as  Farrell).  This  done,  however,  we  shall  then  accelerate  our  pace 
with  only  brief  indications  of  possible  uses  for  our  more  general  developments. 
Then  we  shall  draw  all  of  the  preceding  together  and  try  to  point  up  possible 
further  paths  of  research. 

2.  Background  Illustration 

Farrell  in  (l8)  employed  three  concepts  of  efficiency:  (l) 

technical  efficiency,  (2)  price  efficiency,  and  (3)  overall  efficiency 

(price  and  technical  efficiency).  Like  Farrell,  and  others  who  have 

followed  him,  we  propose  to  restrict  ourselves  mainly  to  his  concept  of 

technical  efficiency.^"  Some  of  the  reasons  for  this  have  already  been 

indicated.  In  addition,  price  data  are  rather  slippery  and  elusive  in 

that  they  may  impound  a variety  of  motives  relative  to  past,  present  and 
2 

future  prospects  and  they  may  inject  certain  types  of  instability  as  these 

motives  and  their  associated  expectations  change. 

These  and  other  problems  associated  with  the  use  of  market 

3 U 

price  data  are  discussed  in  detail  by  Farrell  as  well  as  others.  Hence, 
we  may  dispense  with  further  consideration  of  "price"  and  "overall 
efficiency"  in  order  to  focus  our  own  attention  on  technical  efficiency 
only.  Here  we  need  assume  only  that  all  inputs  and  outputs  have  value. 

That  is,  these  inputs  and  outputs  are  not  explicitly  priced  or  costed 
but  they  all  have  value^  in  the  sense  that  technical  efficiency  is  not 
present  when  it  is  possible  to  increase  any  output  without  decreasing 
any  other  output  or  increasing  any  input.  Conversely,  technical 

^To  deal  with  problems  such  as  increasing  returns  to  scale,  Farrell  felt 
obliged  to  return  to  a use  of  "price"  data.  See  (19).  See  also  Seitz  (35). 

^That  is,  market  prices  reflecting  actual  transactions  as  distinguished 
from  the  "duality  price  relations"  which  are  really  the  same  as  optimal 
substitution  ratios. 

3See  (18)  and  (19). 

Vorsund  in  (20)  provides  a detailed  discussion. 

5This,  as  we  shall  see,  is  related  to  the  concepts  of  Pareto-Koopmans ' 
optimality  and  discussed  in  Chapter  IX  of  (9). 


efficiency  is  also  not  present  when  it  is  possible  to  decrease  any  input 
without  increasing  any  other  input  or  decreasing  any  output.  We  would 

like  to  arrange  our  model  and  methods  of  measurement  so  that  efficiency^ 
will  be  ascribed  to  the  observed  behavior  of  any  firm  only  if  neither  of 
the  preceding  is  true. 

Figure  1,  below,  provides  an  illustrative  hypothetical  example 
which  we  can  use  to  illustrate  Farrell's  approach  as  well  as  our  own. 

For  this  simple  illustration  we  proceed  as  follows.  First 
we  reserve  for  separate  discussion.  Then  we  interpret  P^,....P^  as 
points  in  a two-dimensional  Cartesian  space  in  which  the  coordinates 
represent  observed  amounts  of  two  different  factors  of  production  used  to 
produce  a single  product  by  five  different  "firms." 

We  are  here  concerned  with  the  one  output,  as  distinguished  from 
the  multiple  output,  case.  We  do  have  multiple  inputs  (here  two),  however, 
and,  given  these  observations,  the  first  problem  is  to  locate  the  effi- 
ciency frontiers  or  rather  the  subset  of  extreme  points  from  which  such  a 
frontier  may  be  formed  by  reference  to  the  five  points,  P^,  P^»  and 

P,.  as  shown.  These  "efficient"  points  are  then  joined  by  linear  segments, 
as  in  Figure  1,  to  form  the  corresponding  "efficient  isoquant"  which,  as 
can  be  seen,  is  a "piecewise  linear"  and  continuous  curve.  Here  in  this 
two-dimensional  space  the  efficient  surface  is  a line,  i.e.,  an  isoquant 
line,  but  in  m-dimensions,  "facets"  of  dimension  greater  than  one  will  be 

subtended  in  an  efficient  isoquant  surface 
o 

Actually  a slight  generalization  of  the  idea  of  Pareto-Koopman s'  optimality 
in  the  sense  of  [9]  Chapter  IX  is  involved  here  in  that  we  are  ascribing 
"value"  to  Inputs  as  well  as  outputs. 
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An  "efficient  isoquant"  in  this  analysis  represents  what  Farrell  regards 
*8  "practically  attainable  efficiency."  By  this  he  means  that  at  least  some 
"firms "attained  this  level  of  efficiency  even  though  others  did  not.  Then,  by 
interpolation  between  these  observed  points,  he  provides  a "facet"  (here  a 
line)  of  reference  for  measuring  the  relative  efficiency  of  the  remaining 
firms. 

To  develop  his  measure  of  efficiency,  Farrell  makes  two  key  assump- 
tions. The  first  is  that  any  firm  can  expand  or  contract  along  a "ray"  through  the 
origin  such  as  the  one  indicated  by  the  broken  line  from  the  origin  to  P^. 

The  other  assumption  is  that  all  convex  combinations  of  the  observed  points, 
Pj,...,p£  represent  actual  production  possibilities.  The  "efficient  points" 
then  correspond  to  the  extreme  points  of  this  convex  set  plus  the  union  of 
the  boundary  facets  of  this  convex  set,  i.e.,  the  union  of  the  boundary 
8lmplexes  formed  on  the  efficient  extreme  points.  The  latter  thus  consti- 
tutes the  "efficient  isoquant"  or  the  "efficient  facet"  - viz. , the  locus 
of  points  corresponding  to  the  minimum  inputs  of  all  factors  for  a specified 
level  (e.g.,  one  unit)  of  the  observed  output. 

To  see  what  is  involved  graphically  in  Farrell's  proposed  measure 
refer  to  Figure  1 where  the  "efficient  isoquant"  is  drawn  in  piecewise 
linear  segments  from  P,.  to  and  P^  to  P^.  These  points  constitute  the 
input  vectors,  per  unit  output,  for  firms  3,  4,  and  5 with  linear  interpola- 
tion specifying  what  should  be  attainable  for  P^  and  For  example,  let 
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t(OP2)  “ length  of  "ray  from  origin"  to  P 2 

(1)  and 

^(OPj)  ■ length  of  ray  from  origin  to  P^  which  is  the 

efficient  isoquant  point  "closest  to  the  origin" 
on  the  ray  to  P^. 

Then  Farrell's  measure  of  efficiency  or,  more  precisely,  his  measure  of 
"technical  efficiency"  is 

(2)  0 < TEF(P2)  = i(OPp/Jl(OP2)  <1. 

Note  that  if  P2  had  actually  occurred  its  measure  of  efficiency 
would  have  been  unity.  This  is  to  say  that  because  P2  is  a point  on  the  boun- 
dary line  segment  between  P^  and  P^  it  is  an  efficient  isoquant  point  and 
therefore  the  "closest  to  the  origin"  isoquant  point  on  the  ray  to  P2.  We 
note  that  since  P^  and  are  adjacent  efficient  extreme  points  of  the 
convex  set  of  production  possibilities,  P2  can  be  represented  as  a convex 


combination  of  these  points.  That  is,  we  would  have  ?2  = P^A^  + **4X4  with 

a!.  XI  > 0 and  A!  + X!  = 1.  Since,  in  fact,  P*  is  observed  above  the  line 
4 3 — 3 4 .2  

connecting  P^  and  P^,  its  efficiency  measure  is  less  than  unity,  which  we 

shall  subsequently  relate  to  the  expression  P2  = P^X^  + P^X^  with  A^,  A^  ^ 0 

and  Xg  + X^  > 1 . Moving  next  to  P2"  we  can  also  express  this  as 

P,"  ■ P.Ai  + P.A'  with  A',  A'  >0,  A'  + A!  < 1.  In  each  case  non-negativity 

2 3 3 4 4 3 4 — 3 4 

of  both  variables  is  preserved  with  the  resulting  sura  indicating  where  the 
point  is  relative  to  the  isoquant  segment  connecting  and  P^. 

Now  consider  non-negativity.  All  points  inside  the  cone  with  edges  in- 
dicated by  the  broken  line  from  the  origin  through  P^  and  P^,  respectively,  can  be 
expressed  as  nonnegative  combinations  of  these,  two  points.  Only  when  one  attempts 
express  a point  outside  the  cone,  e.g.,  a point  such  as  P^  , is  the  nonnegativity 
condition  violated.  But  then  a recourse  to  our  piecewise'  linearity  assumption 
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allows  us  to  generate  a new  isoquant  segment  connecting  and  P,.  from 
which  to  evaluate  the  relative  efficiency  of  P^t  and  so  on.  Clearly,  then, 
any  point  such  as  P^  or  P ^ will  not  be  efficient  because  there  exists  a 
convex  combination  of  other  points  which  has  a lower  value  for  at  least 
one  of  its  input  coordinates.  This  is  not  the  case,  however,  for  the 
other  points  P^,  P^,  P^  in  the  set  we  are  presently  considering  or  for 
any  point  lying  on  the  lines  generated  from  all  convex  combinations  of 
Pj,  P^  and  P^,  P^»  respectively. 

Thus,  relative  to  the  indicated  set  of  observations,  we  charac- 
terize only  the  latter  set  of  points  as  efficient.  The  others  are  not 
efficient  on  Farrell's  "ray"  and  "isoquant"  assumptions.  Bear  in  mind, 
however,  that  we  are  directing  our  attention  solely  to  relative  evalua- 
tions and  technical  efficiency  only. 

Having  achieved  our  wanted  clarity  in  only  qualitative  (effi- 

| 

ciency  vs.  inefficiency)  form,  however,  we  now  need  to  provide  a way  of 
meaningfully  measuring  the  "amount"  or  "degree  of  efficiency"  displayed 
by  each  of  our  observed  firms.  We  also  want  to  do  this  is  a way  that 


provides  a convenient  basis  for  subsequent  generalizations.  Therefore  we 
proceed  as  follows.  For  each  of  j = 1,  ...,  n firms  we  are  given  observations 
on  their  Inputs  for  each  of  two  factors  of  production  which  we  symbolize  as 


(3.1) 


x^j  = amount  of  first  factor  used  by  firm  j 


Xjj  = amount  of  second  factor  used  by  firm  j 
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ye  are  also  given  the  amount  of  output  --  sales,  shipments,  or  production  -* 


which  we  symbolize  as 

(3.2) 


«»  amount  of  output  of  firm  j 


In  each  case  we  assume  that  the  amounts  in  (3.1)  and  (3.2)  are  all 
positive  and  then  combine  them  for  ratio  representation,  as  in  Figure  1,  via 


(4)  xij  ‘ Vyj 

*2j  = 

In  other  words,  xj^ , are,  respectively,  the  inputs  per  unit  output 
utilized  by  firm  j *»  1,  . ..,  n.  The  points  in  Figure  1 are  then  represented  as 


Thus  the  observations  in  Figure  1 are  all  "normed"  by  reference  to  the  outputs 
for,  respectively,  each  of  the  j * 1,  ...,  5 firms  we  are  presently  considering. 
Accordingly,  the  "efficient  isoquant"  exhibited  there  represents  the  required  input 
rates  for  1 unit  of  output  and  so  we  also  refer  to  it  as  the  "unit  isoquant." 

We  can  now  conclude  this  section  by  further  formalizing  some  of  our 
preceding  discussion  as  follows.  Consider  any  two  points  and  Pg  which 
form  a basis  for  the  2-dimensional  space  exhibited  in  Figure  1.  That  is,  we 
assume  P^  and  P^,  which  are  2-component  vectors,  are  linearly  independent. 

Any  other  observed  point,  P^,  say,  may  be  expressed  in  terms  of  them  via 


(6.1) 


p„x:  + p x:  « p . 

r r s s o 


Now  assume  P and  P are  also  efficient.  If  X’,  X'  >0  then  P is  in  the  cone 
r s r’  s — o 

with  edges  formed  by  extending  a ray  from  the  origin  through  each  of  and  Pg, 
respectively.  Conversely,  if  P is  not  in  that  cone  then  at  least  one  of  the 
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variables  required  for  the  expression  of  P in  (6.1)  must  be  negative.  If 

o 

(6.2)  X\  X'  > 0 

r*  s — 

and 

(6.3)  X'  + X'  = 1 

r s 

then  Pq  is  on  the  line  segment  connecting  P^  and  Pg.  If,  on  the  other 

hand,  X'  + X’  > 1 then  P cannot  be  efficient  while  if  X'  + X'  < 1 then 
r s o r s 

P and  P cannot  both  be  efficient, 
r s 

In  any  case  since  we  are  considering  points  on  a ray  through  the 
origin,  their  coordinates  in  terms  of  any  basis  are  proportional.  Thus  we 
have 

(6.4)  0 < TEF(P  ) = 1/(X’  + X')  < 1 

— o r s — 

with  (6.2)  and  (6.3)  also  applying  whenever  P^,  Pg  and  Pq  are  all  efficient 

and  in  the  cone.  Alternatively,  if  only  condition  (6.2)  applies,  then  at 

least  one  of  P , P and  P is  not  efficient  while  if  (6.2)  does  not  apply 
r p o 

then  these  three  points  are  not  in  the  same  cone,  and  hence  cannot  be  used 
to  evaluate  each  other's  relative  (technical)  efficiency. 

Our  subsequent  linear  programming  formulations  and  developments 
are  undertaken  with  this  kind  of  piecewise  linear  development  in  mind. 
Hence,  we  pause  for  remarks  like  the  following.  Translated  into  economic 
terms  this  means  that  the  marginal  rates  of  substitution  are  piecewise 
constant  along  the  indicated  efficiency  frontier.  Moreover,  these  substi- 
tution constants,  which  vary  from  one  piecewise  segment  to  another,  do  not 
affect  the  marginal  productivities  of  any  other  pair  of  factors,  which 
are  also  piecewise  constant  (in  their  ratios)  along  their  own  respective 


frontiers. 


Now  we  should  observe  that  the  above  analysis  obviously  extends 
to  other  functional  representations  which  can  also  be  transformed  into 
suitable  piecewise  linear  format.  A case  in  point  would  be  a development 
via  piecewise  Cobb-Douglas  functions.  These  functions  being  representable 
as  linear  under  a logarithmic  transformation  were,  in  fact,  utilized  by 
Farrell  in  his  original  empirical  studies  [18].  These,  and  other  like 
possibilities  which  may  be  present,  will  also  be  preserved  in  the  develop- 
ment that  follows.*' 

On  the  other  hand,  we  will  not  discuss  cases  which  are  not  trans- 
formable to  such  piecewise  linear  equivalents.  Neither  will  we  discuss 

2 

other  cases  such  as  increasing  returns  to  scale,  etc.,  which  violate 
convexity  (even  after  suitable  transformations)  and  the  related  assumptions 
that  we  are  also  making.  Finally,  we  may  observe  that  Farrell's  ray 
assumption  imputes  the  property  of  constant  returns  to  scale  to  each  firm. 
Actually,  this  assumption  is  stronger  than  necessary  since  we  only  require 
that  an  intersection  be  effected  with  the  convex  isoquant  at  the  indicated 
point,  but  to  relax  this  ray  assumption  would  involve  introducing  additional 
complexities  into  the  analysis.  Hence  we  continue  with  Farrell's  ray 
assumption  while  observing  that  the  assumed  scale  constant  it  implies  will, 
in  general,  differ  for  every  firm. 


Actually  Farrell  did  not  use  piecewise  Cobb-Douglas  functions  but.as  Duffin, 
Peterson  and  Zener  [17]  pp.  265-267  show,  the  extension  involved  in  optimizing  a 
Cobb-Douglas  function  under  Cobb-Douglas  type  constraints  goes  over  into  an 
ordinary  linear  programming  problem.  See  also  Charnes,  Cooper  and  Kortanek  [11] 
linear  programming  problem.  See  also  Charnes,  Cooper  and  Kortanek  [11]  for 
more  general  functions  which  can  be  represented  in  infinite  systems  of 
linear  inequalities  under  suitable  transformations. 

2 

See  Farrell  and  Fieldhouse  [19]  and  Seitz  [35]. 
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is  assumed  to  be  a vector  of  ob.«  ervations  associated  with  one 

o 

of  j •*  1,  . ...  n firms  whose  inputs,  per  normed  unit  output,  have  been  observed. 
For  concreteness  let  Pq  = and  let  it  be  desired  to  determine  whether 

is  efficient.  To  do  this  we  form  the  linear  programming 


problem, 

(7.3) 


max  z 


with 


n 

E A! 

° j 


n 

i m;  • p.  i ° 

j-i  j 3 ° 1 

and,  to  insure  that  we  consider  P relative  to  a cone  on  efficient  extreme 

o 

points  which  contains  Pq,  we  employ  an  adjacent  extreme  point  algorithm 
such  as  the  "simplex"  or  "dual"  method  to  solve  this  linear  programming  problem. 
We  then  have  as  one  possible  solution 


(8) 


p.a: 

k k 


P.  = P 
k o 


with  A,'  ■ 1.  It  follows  that  Max  z * z*  > 1 and  therefore 
k o o — • 


n 


(9) 


0 < TF  (P  ) = 1/ (EA*' ) = 1/z*  < 1 
° j-l  J ° “ 


as  required. 

We  need  to  insure  that  the  efficient  isoquant,  when  attained, 
accommodates  all  possibilities.  For  instance,  we  need  to  insure  that  some 
firm  using  an  extremely  large  amount  of  one  factor  and  a very  small  amount 
of  every  other  factor  will  still  be  on  a ray  which  intersects  the  efficient 
isoquant  en  route  to  the  origin.  Farrell  adds  "points  at  infinity"  to 
insure  this  but  this  is  awkward  and  so  we  now  undertake  a development  which 

enables  us  to  move  ahead  without  recourse  to  these  constructs. 
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We  can  represent  the  points  at  infinity  os  » Me^,  i ■*  1,  . ..,  m, 

where  M is  a non-Archimcdean  transfinite  number  (M  > r,  r any  real  number)  }J 

and  the  e^  are  the  unit  coordinate  vectors  - see,  e.g.,  e^  and  in 

Figure  1.  The  other  points  P have  finite  and  non-negative  entries.  But 

now  consider  the  non-negative  representation  of  P , a finite,  non-negative 

o 

vector,  in  terms  of  the  P and  Q^,  - e.g., 

p0 " *y- + * 

j-i  3 2 i-i 


(10.1) 


n m 

- z Vi  + M £ 

j-i  J 1 i-i 


where 

Aj , § £ > 0 . 

Since  the  entries  in  the  P and  e^  are  finite  and  non-negative, 

such  a representation  is  possible  in  terms  of  finite  if  and  only  if 

the  § . are  of  the  form 
i 


(10.2) 


5i  " ^i 


where  the  are  finite  and  non-negative.  Thus,  we  get  the  equivalent 

finite  representation  of  the  constraints  as 
* 

n m 

(10.3)  E P.A!  + Z e.s,  = P 

j-i  -1  i i=i  1 1 ° 

Aj,  Si  > 0 . 

To  complete  this  part  of  the  analysis  we  now  explicitly  formulate 
the  linear  programming  problem  with  points  at  infinity  as  follows, 


1/  See,  e.g.,  Robinson  [31], 
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Max  z ■ E X'  + E § . 
° j«l  j i=l  1 


(11.1) 


with  P = E P \\  + E Q , e 
O j J i-i1*1 


X'y  \ > 0. 


Then  we  replace  this  with  the  following  new  form 


n ra 

Max  z = EX'+tjEs. 
° J-l  J Mi=l  1 


(11.2)  with  P = E ?\\  + E e.s, 

° j=l  j J i=l  1 1 

x'ysi>  o. 

where  we  also  note  that  the  constraints  can  be  written  in  our  newly  established 
equivalent  inequality  form  as 


P > E P X! 

° ”*  J-l  J J 

X'  > 0,  j « 1,  . . . , n. 


This  inequality  format  for  the  constraints  can  be  given  a geometric 
interpretation  in  terms  of  relative  efficiency  of  two  production  possibilities  - 
namely,  Pj  is  at  least  as  efficient  as  P^  if  Pj  < P^  . In  this  manner  the 
Farrell  efficient  (or  frontier)  points  are  simply  "vector"  or  "Pareto  minimal" 
points  of  the  set  of  all  production  possibilities.  This  means  that  Pj  is  not 
efficient  relative  to  Pj  if  Pj  contains  at  least  one  entry  strictly  less  chan 
the  corresponding  entry  in  P.. 
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A further  geometric  interpretation  of  the  set  of  production  possi- 
bilities can  also  be  made.  It  is  a convex  set  derived  from  the  convex  hull 
of  the  observed  points  by  adding  to  this  convex  hull  all  points  northeast  of 
it.  Thus,  if  Pr  is  a production  possibility  then  so  is  Pg  if  Pg  > 

The  non-Archimedean  programming  problem  (11.2)  provides  us  with  a 
z*  whose  reciprocal  is  the  Farrell  efficiency  measure  up  to  a (non- 
Archimedean)  infinitesimal.  If  any  of  the  s^  - which  may  be  considered  as 
slack  variables  for  the  representation  (12)  - is  in  at  positive  value  in  an 
optimal  solution,  then  Pq  cannot  be  efficient.  If  all  s^  are  at  zero  value, 

then  the  infinitesimal  part  of  z*  is  zero.  Thus,  for  our  new  computational 

o 

version  of  efficiency  we  can  replace  our  earlier  convexity  characterization 


with 


P^  *=  PQ  is  efficient  if  and  only  if 


(13)  i.  Its  optimal  z value  is  z*  = 1,  and 

o o 

ii.  The  slack  variables  are  all  at  zero  value  In 
every  optimum  tableau. 


for  the  problem 


Max  z 


n 

Z X’  . 

° i=l  J 


with 


(14.1) 


z p.x:  < p 

j 3 3 - o 


X.  >0 


or  its  equivalent 


Max  z = EX! 

° j-i  J 

with 


n m 

I P.X!  + I e s 

j=l  3 J i=l  J 


= P 


(14.2) 
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4.  Computational  Example 

We  Illustrate  the  above  developments  via  the  data  of  Figure  1.  To 
initiate  this  illustration  we  shall  continue  to  consider  only  the  points  P^, 

. ...  P,  and  reserve  P for  separate  consideration.  This  leads  to  the  tabular 

J 6 

arrangement  in  Table  1 where,  with  the  conventions  of  [9],  we  use  B and  to 
designate  basis  vectors  and  their  functional  coefficients  at  each  stage.  Thus, 
at  "Stage  0"  the  basis  B consists  of  the  slack  vectors  e^  and  e ^ with  functional 
coefficients  of  zero  shown  explicitly  on  the  left. 

(Insert  Table  1) 

Using  full  (rather  than  contracted)  tableau  arrangements  for 

effecting  our  simplex  calculations,  we  proceed  from  "Stage  0"  to  an  optimum 

at  "Stage  3?"  from  which  we  secure  the  following  information:  P^  and  P,. 

are  in  the  optimal  basis  and  hence  are  both  efficient.  Concomitantly, 

P = P,  has  z*  ■ 7/6  > 1 and  hence  is  not  efficient  - see  i in  (13)  - and,  in 
o 1 o 

fact, 

TEF(P  ■=  PJ  = 1/z*  - 6/7 
o 1 o 

measures  the  reduction  in  factor  inputs  which  could  have  put  this  firm  on 
the  unit  isoquant  if  it  had  been  producing  efficiently  - viz. , 


SIMPLEX  METHOD 


4 
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That  is,  the  operational  meaning  that  we  (like  Farrell)  accord  to  this  measure 

Is  that  a reduction  of  both  of  these  inputs  by  the  indicated  ratio 

would  have  resulted  in  th*»  same  output  if  this  firm  had  produced  efficiently. 

Thus  only  with  TEF(Po)  ■ 1 would  no  such  reduction  be  possible  so  that  the  firm 

associated  with  P could  be  characterized  as  efficient, 
o 

Since  P^  and  P^  are  efficient,  there  is  a possibility  that  the  basis 

of  Stage  3*  could  also  be  used  to  measure  the  efficiency  of  other  points  besides 

P^.  In  fact,  this  would  be  the  case  if  any  of  the  other  vectors,  and  P^, 

had  only  non-negative  entries  in  their  columns  at  this  tableau  stage,  i.e.  , 

at  Tableau  Stage  3*.  However,  this  is  not  the  case.  Both  ?2  and  P^  have 

negative  entries  in  their  columns  — which  means  that  neither  they,  nor  the 

rays  to  them  (from  the  origin) , lie  on  or  intersect  the  line  segment  connecting 

P^  and  P^.  In  other  words,  P^  and  lie  outside  the  cone  generated  from  the 

origin  through  Py  and  Pr  and  hence  cannot  be  expressed  as  non-negative  combina- 

o 

tions  of  these  two  points.  Cf.  Figure  1 and  the  discussion  in  Section  2. 

We  next  observe,  with  A.  J.  Hoffman  in  [18]  that  we  are  now  in  a 

position  to  use  C.  E.  Lemke's  "dual  method". This,  too,  is  illustrated  in 

Table  1 by  according  P_  the  status  of  a new  P and  transferring  the  data  for 

l o 

it  at  Stage  3*  to  the  new  tableau  represented  as  "Stage  A".  We  effect  this 

I transfer  only  for  clarity,  however,  since  we  could  also  have  proceeded  directly 

from  Stage  3*  without  any  tableau  such  as  Stage  4 to  achieve  the  new  optimum 
shown  at  Stage  5*.  This  is  all  done  via  the  dual  method,  as  indicated,  and 
hence  there  is  no  need  to  start  afresh  or  even  to  backtrack  to  earlier  tableaus. 

, -1 — 1 — - -- 

1/ 

- See  [29J.  For  further  treatment  of  the  dual  method  also  see  Chapter  XI of  [9]. 


L A 
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The  results  in  Stage  5*  show  that  and  are  efficient  and  that 

Pj  is  not  efficient.  Indeed,  P^  is  "exactly"  as  inefficient  as  P^.  Observe, 

however,  that  P^  was  referred  to  P^  and  P,.  while  P^  was  referred  to  P^  and 

P^,  so  that  the  efficient  referents  are  not  the  same.  The  term  "exactly"  must, 

of  course,  be  taken  with  a "grain  of  salt"  in  any  case,  since  there  is  some 

Toom  for  observational  error  which  would  presumably  be  evident  on  replication,* 

2 

and  this  is  a weakness  which  we  referred  to  in  our  introduction. 

On  the  other  hand,  the  fact  that  the  referents  generally  differ  for 
each  firm  which  is  to  be  evaluated  can  have  advantages  for  ce  tain  kinds  of 
applications.  In  evaluating  educational  programs,  for  Instance,  the  factor 
Inputs  utilized  by  different  "firms"  (e.g.,  schools  or  school  districts)  may  differ 
from  one  area  to  another  because  of  legal  or  other  requirements.  Hence  we  would  li 
to  have  the  basis  for  efficiency  evaluations  selected  from  among  the  efficient  vect 
which  are  as  much  alike  as  possible  to  the  vector  being  evaluated.  For  instance,  we 
would  prefer  to  have  P^  evaluated  with  P,.  and  P^  as  referents  and  P ^ evaluated 
with  P^  and  P^  as  referents  instead  of  having  P^  and  P ^ both  evaluated  with 
P^  and  P^  as  a basis.  Loosely  speaking,  our  model  (and  computational 
routines)  have  this  property  and,  in  addition,  the  vectors  in  the  basis  which 
are  closest  to  the  point  being  evaluated  will  also  tend  to  receive  the  greatest 
weight. 


The  reader  should  also  refer  to  the  exchange  between  M.  G.  Kendall,  M. 
Quenouille,  and  M.  J.  Farrell  in  [18] . 

2 

See  the  discussion  of  our  DEA  approach  to  data  treatment  in  [30]. 


m 


5.  Slack  and  Alternate  Optima 


The  preceding  remarks  need  some  amplification  which  we  shall  try 
to  supply  in  a way  that  also  clarifies  condition  ii  in  (13).  For  this  we 
now  turn  to 

5 

p6  -'Li.- 

As  depicted  in  Figure  1,  this  point  lies  to  the  right  of  the  efficiency 

frontier  which  terminates  at  P^.  Via  the  developments  provided  in  (10.1) 

ff.,  we  can,  of  course,  exhibit  the  nonefficiency  of  P,  within  the  tableau 

o 

for  (14.2).  We  proceed,  however,  to  a separate  treatment  of  P,  to  show, 

D 

inter  alia,  how  some  of  the  additional  information  available  in  these 
extended  tableaus  may  be  utilized. 

First  we  observe  that  the  inverse  of  any  basis  is  found  under 
the  slack  vectors  at  any  stage. ^ Thus,  continuing  with  (14.2)  we  imme- 
diately have 

»'1  - <v  v1  - [1/6  ii] 

from  Stage  5*  in  Table  1.  Hence,  we  also  have 


for  insertion  in  the  "F0"  column  at  this  stage.  As  was  the  case  at  Stage  4, 

one  of  the  components  is  negative,  but  we  are,  nevertheless,  in  a position 

2 

to  continue  with  the  dual  method,  just  as  before. 

^See  [9]  and  [16]  for  further  discussion. 

2 

Since  P^,  the  new  vector  to  be  considered,  has  Pj_  Pf,  the  usual  condi- 
tion for  continuation  with  the  dual  method  is  automatically  satisfied. 

I.e.,  one  has  Zj  - c^  >_  0,  all  j,  including  j=k. 
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One  iteration  with  the  dual  method  produces 

i 


P3  X*  + e^^  s* 


P - P, 
o 6 


with 

X*  ■ 1 and  s*  ■ 1 

and  all  other  variables  at  zero  value.  Evidently  the  zero  slack  requirement 

in  ii  of  (13)  is  violated  and  so  P&  is  not  efficient.  The  interpretation  is  also 
relatively  straightforward  - viz. t a reduction  of  = 5 to  ■ 4 would 
eliminate  this  positive  slack  and  bring  P^  into  coincidence  with  P^,  in  which 
case  X*  = z*  = 1 and  both  conditions  in  (13)  being  satisfied  efficiency 
would  again  be  achieved. 

Now  we  observe  that  an  alternate  optimum  is  present  under  (14.1) 

I 

since  also  z*  - 1 with  X*  = 1 and  s*  = 0.  However,  as  we  have  just  seen, 
o 6 1 

Pg  *8  not  efficient  relative  to  the  basis  B * (P^,  e^)  since  evidently 

a reduction  in  x^  is  possible  relative  to  this  basis. 

Thus  it  is  necessary  to  insure  that  no  such  slack  is  present 

in  any  alternate  optimum.  Among  these  alternate  optima  one  would  then 

m 

select  the  one  with  maximal  Z s*  which  is  equivalent  to  working  with 

i=l  1 

(11.2)  rather  than  (14.2).  In  the  present  case  this  would  give  z*  * 

1 + ^ > 1.  In  other  words  when  we  employ  (11.2)  rather  than  (14.2)  we 

need  utilize  only  condition  i in  (13).  On  the  other  hand,  we  can 

continue  to  work  with  (14.1)  provided  we  are  also  willing  to  apply  ii  of 

(13)  in  the  indicated  manner  since  the  only  way  that  this  ambiguity  can 

1 

occur  is  via  the  presence  of  positive  slack  in  some  alternate  optimum. 


*This  ambiguity,  too,  may  be  repaired  in  a more  formal  manner.  We  shall  not 
pause  to  do  this  here,  however,  since  in  Section  8,  below,  we  shall  provide 
a more  symmetric  formulation  which  will  reduce  only  to  condition  i of  (13)  so 
that  condition  ii  of  (13)  may  then  be  Ignored. 
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6.  Duality  and  Other  Tableau  Characterizations 

We  should  stress  the  importance  of  employing  an  adjacent  extreme 
point  method  in  the  calculations  and  characterization  of  extreme  efficient 
points.  In  particular,  as  will  be  seen,  the  tableaus  resulting  from 
simplex  and  dual  method  calculations  - which  are  both  adjacent  extreme 
point  methods  - provide  a wealth  of  information  concerning  such  things  as 
efficient  frontier  facets  and  the  normal  vectors  to  them. 


- 31  - 


In  order  to  reap  out  the  efficient  isoquant  we  proceed  by  determin- 
ing the  extreme  points  spanning  one  facet  plus  the  normal  to  this  facet. 

We  then  move  on  through  adjacent  efficient  facets  and  continue  doing  so 
until  all  of  the  wanted  efficient  facets  and  normal  directions  have  been 
determined.  The  latter,  which  correspond  to  an  optimal  solution 
to  the  dual  problem  associated  with  the  primal  problem  formed  from  (14.1) 
or  (14.2),  provides  us  with  estimates  of  the  isoquant  slopes,  as  we  shall  show. 

Our  procedure  allows  us  to  characterize  these  efficient  facets 
explicitly  and  in  ways  which  have  not  heretofore  been  available.  Hence, 
we  shall  develop  them  here  in  all  detail  and  generality  as  follows. 

We  shall  first  assume  that  all  of  the  observed  points  have 
positive  components.  Also,  if  m is  the  number  of  such  components,  then 
we  shall  also  assume  that  there  is  at  least  one  basis  for  m-dimensional 
space  among  the  vectors  P^.  We  can  then  start  by  choosing  Pq  as  some 
point  which  is  properly  interior  to  the  cone  with  apex  at  the  origin  and 
spanned  by  P^,  ...,  Pn«  This  can  always  be  arranged  since,  for  example, 
n 


P 

o 


will  have  this  property. 


An  illustrative  example  may  help  to  fix  ideas  and  provide  con- 
crete insight  for  our  procedures  and  interpretations.  Thus,  going  back 

to  the  data  of  Figure  1 (without  P,1  and  chocsing  P = P. initially,  as 

o o i 

before,  we  set  up  the  simplex  tableau  for  (14.2)  and  then  proceed  as  in 


Table  1. 
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Notice  Chat  for  (14.2)  we  always  have  a basis  of  natural  slack 
unit  vectors  as  in  Stage  0.  Notice  further  that  the  tableau  entries  under 
the  structural  vectors  P^,  ....  and  the  slack  vectors  e^,  arc  com- 

pletely independent  of  the  entries  under  the  stipulations  vector  P - i.e., 

o 

the  entries  under  the  structural  and  slack  vectors  in  each  tableau  are 

the  same  no  matter  what  P is  selected. 

o 

In  particular,  consider  Stage  2 in  Table  1.  As  is  evident  from 

Figure  1,  both  P^  and  P^  are  efficient  extreme  points.  Nevertheless, 

the  facet  spanned  by  them  is  not  efficient.  Correspondingly,  the  Stage  2 

tableau  is  not  an  optimal  tableau  since  z^  - < 0.  But  Stage  3* 

is  an  optimal  tableau  even  though  z*  = 7/6  > 1 tells  us  that  P^  is  not 

efficient.  Now  replacing  P^  by  Pq  = P^v^  + P^Vj  f°r  any  V^,,  V,.  0 and 

■ 1 will  not  change  the  z^  - c.  but  will  give  us  a Pq  column  of 

V4,  with  z*  * 1 so  the  whole  facet  spanned  by  P^,  P^  is  efficient. 

. The  lesson  here  is  that  we  can  start  with  P an  interior  point 

o 

and  proceed  through  simplex  tableaus  to  an  optimal  tableau.  Then  replacing 

P^  by  any  convex  combination  of  the  optimal  basis  vectors  ve  will  achieve 

a z*  = 1.  Hence  we  will  have  exhibited  an  efficient  facet  and,  as  we  shall 
o 

show,  the  equation  of  the  hyperplane  containing  this  facet  will  also  then 
be  available.  For,  as  is  well  known,—' ^ the  z^  - c^  under  the  slack  vectors 
in  an  optimal  tableau  are  an  optimal  solution  to  the  dual  problem.  I.e., 
these  optimal  z^  - c^  values  under  the  slack  columns  in  the  primal  tableau 
are  the  optimal  values  w = ui*  for  the  dual  problem, 

Min  g = u)TP 
o o 


(15) 


with  w P > 1,  j * 1, 


T 

U)  > 0 


— ^See,  e.g.,  [91  or  [16]. 
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and  <u  Pj  ! 1 for  each  in  an  optimal  basis.  The  superscript  T represents 
transposition,  as  usual,  so  that,  e.g. , represents  the  transpose  of  the  column 
vector  (Vwith  components  uuj, ....  u>nl  and  UJ*  denotes  an  optimum  value  for  these 
variables  in  the  above  problem. 

To  6how  that  is  orthogonal  to  the  efficient  facet  spanned  by  these  Pj, 
we  need  only  show  uj*  is  orthogonal  to  any  direction  lying  in  the  facet,  e.g, , to 
any  vector  which  is  the  difference,  P - P,  of  two  vectors  in  the  facet.  Since  P 
and  P are  in  the  facet. 


P ■ I P,  v. , P - E P v 
i x i 1 1 


0 < V1#  V <1 


(16.1)  and 


E V.  0 E V.  =1 
i i 


where  summation  is  over  the  Indexes  Indicated  by  these  P^.  But  then 


w*  (P  - P)  - E (u*T  Pj.  v±  - o*T  P±  V±) 


(16.2) 


- E (IV  - lVJ.iv,  - E V = 0, 

i l 1 i 1 i 


since  E V - E v **  1.  Q.E.D.  Hence,  the  u*  corresponding  to  this 
i 1 i 1 

efficient  tacet  (and  simplex)  determined  by  this  optimal  basis  is  orthogonal 
(or  normal)  to  it.  Thus,  w*  is  normal  to  the  hyperplane  containing  this 


facet.  The  equation  of  this  hyper plane  is 


(17) 


x 


1 


— J*« 


vhcrc  x is  any  point  in  the  linear  space  spanned  by  the  totality  of  the 

P. *8  and  e 's. 

J X • 

We  now  return  to  the  two-dimensional  example  of  Figure  1 in  order 

to  exemplify  these  developments.  Taking  Stage  3*  in  Table  1 for  illustration, 

T 

we  have  w*  « (1,  JL)  as  is  apparent  from  the  z-  - c values  listed  under 
3 6 J J . . . 

the  slack  vectors  at  this  stage.  The  points  of  the  efficient  facet  are 

PQ  “ v4  + v5  *or  a*1  V4»  V5  — ® with  v4  + >*5  “ Thereby  we 

have  u*TP  ■ (u^P.)  v + (w*Tp  ) v_. 

o h 4 a 5 * 


Moving  to  (17)  we  therefore  have,  for  this  two-dimensional 


w*  x ■ Oj*  x^  + u>2*  x2  ® 1 

which,  via  the  results  available  at  Stage  3*  in  Table  1,  become 

1 x,  + .1  x-  --:\i. 

3 A 6 

In  Figure  1,  this  facet  is  on  a line  with  the  segment  connecting 
P^  and  P^  constituting  part  of  the  efficient  isoquant  and,  in  fact, 
this  segment  may  now  be  rendered  explicitly  as  the  set  of  points  repre- 


sented in 


(Xj,  x2)  : 1 x1  + 1 x2  ■ 1;  1 < x1  < 2,  2 <_  x2  < A 


Continuing  to  Stage  5*  and  observing  the  z^  — values  under  the  slack 
at  this  stage  we  then  obtain  the  expression  for  the  adjacent  Isoquant 


segment  as 


f3'4]  = 


» *0^  : 1,  x,  + 1 x, 
6 1 3 1 


1;  2 <xx  < «.  1 <x2<  2j 


i 
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Of  course,  the  property  to  be  emphasized  is  that  the  numerical 
values  of  the  coefficients  in  these  expressions  are  available  without 
any  extra  effort.^  Hence,  also  without  extra  effort  we  have  a new  method 
of  effecting  numerical  estimates  of  extremal  relations  from  observational 
data.  This  should  be  of  interest  for  economics,  or  any  other  discipline 
where  explicit  numerical  estimates  of  such  relations  are  wanted  — e.g., 
under  the  kind  of  conditions  remarked  upon  in  our  introduction. 


7.  Production  Functions  and  Cost  Relations 

We  now  leave  aside  other  issues  like  invariance  of  our  efficiency 

2 

measures  in  order  to  concentrate  on  the  production  functions  associated 
with  these  coefficient  estimates.  These  functions  are  evidently  not  of 
the  variety  which  have  customarily  been  used  in  empirical  studies.  They 
are  not  of  the  aggregate  variety  which  generally  assume  that  (i)  all  firms 


^The  isoquant  segment  extending  from  to  may  be  represented  by 

S[3,  1')  S {(x^  x2)  : 1 = 0 x1  + x^;  4 < x^  1 <_  x2) 
or  by 

S[3,  1')  = ((x^,  x2)  : 1 = ^ X1  + x2;  4 £ s^,  1 £ x2> 


where  the  prime  refers  to  the  subscript  on  Q and  the  square  bracket  denotes 

inclusion  of  the  point  whose  subscript  is  next  to  the  bracket,  while  the 

round  bracket  denotes  exclusion.  The  expressions  refer  to  the  segments  from 

P,  to  Q1  in  Figure  1,  according  to  whether  (11.])  or  (14.2)  is  employed  - 

and  a similar  development  may  obviously  be  undertaken  for  the  segment 

extending  from  P,.  to  Q . 

2 5 1 
See  Appendix. 
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l 

, 


h 


! 


have  the  same  production  function  and  (ii)  are  on  their  efficiency  frontiers 

as  a necessary  condition  for  obtaining  access  to  the  relevant  micro  theorems. 

2 

Neither  of  these  assumptions  is  used  here  but,  of  course,  we  proceed  on 

3 

the  assumption  that  relative,  rather  than  theoretically  attainable, 

efficiency  suffices  for  access  to  the  theorems  and  procedures  of  micro 

4 

analysis.  Indeed  it  is  one  purpose  of  the  preceding  development  to  distin- 
guish between  inefficient  and  efficient  firms  in  such  a way  that  only  the 
latter  enter  into  the  coefficient  estimates  for  these  production  functions. 

Evidently  these  production  functions  are  also  not  of  the  variety 
that  have  been  used  in  empirical  studies  at  the  level  of  individual  firms, 
since  all  relevant  firms,  or  at  least  all  relevant  efficient  firms,  are  included 
in  the  observations  from  which  these  coefficient  estimates  are  obtained. 

In  some  ways  these  production  functions  are  reminiscent  of 
Alfred  Marshall's  concept  of  a "representative  firm."**  Here,  however,  the 


referent  is  rather  to  "representative  efficient  firms."  Note  that  the 


I.e.,  identically  the  same,  apart  from  a multiplicative  constant.  We  are 
referring  to  the  case  which  Sato  [33],  pp.  3-8, refers  to  as  the  Solow-Fisher 
(exact)  aggregation  problem.  These  conditions  may  be  relaxed  under  a variety 
of  assumptions.  For  instance,  if  one  is  willing  to  assume  that  all  firms  have 
extended  Cobb-Douglas  functions  in  the  sense  of  [13],  but  not  necessarily  the 
same  degree  of  homogeneity,  then  the  aggregate  function  is  also  Cobb-Douglas  in 
this  same  extended  sense.  See  [14].  Condition  (ii)  must  be  satisfied  in  any  case, 
since  otherwise  the  output  for  given  inputs  will  not  be  maximal. 

2 

Cf.,  the  discussion  in  Farrell  [18]. 

^ Again  vide  Farrell  [18]  who  argues  that  this  theoretically  attainable  efficiency 
lacks  operational  significance  for  practical  applications.  On  the  other 
hand,  we  ought  to  note  that  new  constraints  may  be  adjoined  i the  above 
programming  models  whenever  such  a_  priori  theoretical  boundaries  are  known 
to  be  applicable  to  particular  firms. 

We  are  speaking  generally  since  assumptions  like  continuity  in  the  deriva- 
tives will  evidently  need  to  be  modified. 

^Cf.,  Johnston  [24]. 

**See  the  discussion  in  Stigler  [38], 


v 


- 37  - 


plural  is  required  insofar  as  there  is  more  than  a single  facet.  The 

continuum  within  each  facet  is  then  representative  of  the  efficiency  for 

which  the  originally  observed  efficient  firms  serve  as  referents. 

Note,  in  particular,  that  we  cannot  generally  average  across 

facets  to  obtain  a reduced  set  (e.g.,  a single  set)  of  representative 

coefficients^"  without  losing  the  property  of  efficiency  associated  with 

2 

the  original  estimates.  In  certain  circumstances,  however,  we  might  want 
to  regard  this  as  being  the  production  function  associated  with  one  "super- 
firm."  This  might  be  useful  when,  for  instance,  all  of  the  underlying 
entities  are  participants  in  a single  government  program  with  an  administrator 
who  would  like  to  obtain  estimates  of  the  costs  which  are  to  be  expected  with 
efficient  operations. 

Normally  such  costs  would  be  obtained  from  actual  or  projected  market 

data  such  as  forecasts  of  teacher  salaries  in  particular  school  districts  or  costs 

of  supplies  and  so  on.  Here,  however,  we  want  to  relate  our  developments  to 

3 

another  type  of  duality  wherein  C(y,p)  is  a cost  function  to  be  determined  via 

T 

C(y,  p)  = min  p x for  xeL(y) 


where 


L(y)  = {x  : at  least  the  output  vector  y is  produced}. 


^General  guidelines  for  aggregation  and  disaggregation  will  be  supplied 
in  the  next  section  for  use  when  various  kinds  of  cross-comparisons  are 
wanted. 

2 

This  is  also  true  for  the  interpretations  of  results  derived  from  cross  section 
data  as  in  the  interpretations  of  Bronf enbrenner  and  Douglas  discussed  by  Sato 
[33]  in  Chapter  8 and  hence  is  also  a deficiency  of  the  envelope  and  interfirm- 
intra  firm  production  function  interpretations  of  Bronf enbrenner  and  Reder 
discussed  by  Sato  in  this  same  Chapter. 

2 

We  have  elsewhere  suggested  that  this  might  better  be  associated  with  the 
branches  of  mathematics  referred  to  as  "transform  theory."  See  [15]. 
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In  other  words  L(y)  is  the  point  to  set  mapping  y— *L(y).  For  instance, 
in  Figure  1,  it  is  obtained  via  the  isoquant  associated  with  y ■ 1 so  that 
in  this  case  L(y)  designates  all  of  the  points  x = (x^,  x associated  with 
input  combinations  on  or  to  the  northeast  of  this  isoquant.  In  any  event, 
from  a knowledge  of  these  relations,  the  cost  function  C(y,  p)  is  then  to  be 
obtained  via  the  indicated  minimization  where  p is  a price  vector  with 
component  p^  representing  the  "price"  per  unit  x^  the  amount  of  the  ith 
factor  input. 

The  idea  is  to  relate  the  above  developments,  including  Farrell 

Efficiency,  to  another  strand  of  theoretical  work  in  economics*  emanating 

2 

from  the  original  formulations  by  Samuelson  [32]  and  Shephard  [36].  The 
latter  work,  like  other  parts  of  micro-theory,  has  been  developed  with 
respect  to  decision  making  at  the  individual  (firm)  level.  Thus,  following 
Shephard  [37]  we  may  formulate  the  problem  of  obtaining  a cost  function 
from  an  already  known  production  relation  by  parametrically  varying  p for 
each  y to  produce  the  portion  of  the  production  function  associated  with  the 
L(y)  for  each  y.  Then  parametrically  varying  y produces  the  entire  cost 
function  C(y,  p)  from  the  transformation  indicated  in  the  expressions 
previously  displayed. 


See  [22.1],  [22.2]  and  [22.3]  for  detailed  and  extensive  discussions. 

^We  have  elsewhere  shown  how  Farrell's  efficiency  measure  can  be  explicitly 
related  to  Shephard's  gauge  function.  See  [15]. 
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In  our  case  we  want  our  production  function  to  be  empirically 
based.  That  is,  we  want  our  production  function  to  be  based  on  observed 
input  output  values  and  estimates  derived  from  them  such  that  no  firm  from 
the  observation  set  has  a larger  output  for  any  Inputs  that  may  be 
specified.  Also  no  non-negative  combination  of  these  firms  can  have  a 
larger  output  when  extrapolations  or  interpolations  from  the  original 
observations  must  be  undertaken  for  the  specified  input  values. 

With  this  definition  we  can  then  proceed  to  obtain  the  wanted 


cost  function  as  follows.  Let  y be  some  prescribed  output  (a  scalar)  and 


th 


let  a , the  8 row  of  the  matrix  A represent  the  set  of  coefficients 
s 


th 


associated  with  s efficient  facet  estimated  from  the  data  as  described 
in  the  preceding  section.  Let  P be  a matrix  with  its  row  vectors 


Pj  representing  the  observational  data  for  each  of  the  original  j **  1, 
1 


n firms.  Then  on  this  definition  of  optimality  (i.e.,  relative 

efficiency  of  the  production  frontiers)  we  can  obtain  the  wanted  cost 


function  from  the  following  formulation, 

^ T 
min  p x 

with 


(18.1) 


A x > I y 
-PA  +'I  x = 0 
A > 0, 


in  which  I is  the  identity  matrix  so  that  (1)  1^  = PA,  A > 0 assures  us  that  we  will 


be  deriving  our  production  function  from  empirically  based  observations,  and  (2) 


HgX  > y together  with  the  minimizing  objective  assures  us  that  we  will 


always  be  on  an  efficient  frontier. 


One  can  omit  the  Pj  which  are  not  efficient  or  else  one  can  adjust  and  bring 


them  into  the  efficient  set  in  the  manner  to  be  indicated  in  the  next  section. 


J 
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The  (mathematical  programming)  dual  to  (18.1)  is 
T 

max  yn  e 

T T T 
with  n A + u I ■ p 

(18.2)  - uTP  < 0 

nT  > ° , 

where  e is  a column  vector  with  unity  for  all  its  elements.  Via  the 
duality  theorem  of  mathematical  programming,  we  then  have 

(19.1)  pTx  > ynte 

for  all  x,  X and  n,  u which  satisfy  the  constraints  and 

(19.2)  pTx*  = yn*Te, 

at  an  optimum.  In  other  words, 

(19.3)  C(y,  p)  = yn*Te  = pTx* 

is  the  required  (minimizing)  cost  function,  which  varies  with  each  choice 
of  y and  p. 

Here  we  have  proceeded  from  the  production  function  to  the  cost 
function  but,  of  course,  we  could  also  have  proceeded  via  the  opposite 
course.  The  latter  is  an  "in  principle"  statement  only,  however,  since,  as 
noted  in  the  introduction,  many  of  the  inputs  and  outputs  in  public  sector 
applications  are  not  easily  priced  or  costed  without  recourse  to  arbitrary 
and  ex  cathedra  procedures  and  assumptions.  Thus  we  shall  prefer  to  continue 
from  the  production  rather  than  the  cost  side  after  observing  that  additional 
constraints  may  need  to  be  adjoined  to  (18.1)  or  (18.2)  to  meet  a variety  of 
legal  or  institutional  requirements.^ 

^The  adjunction  of  such  constraints  can  cause  trouble  for  the  Shephard-Samuelson 
transforms,  however.  See  [15]. 
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8.  Efficiency  and  Multiple  Output  Production 

Having  exhibited  how  our  production  function  and  its  estimation 
differ  from  others  which  have  been  customarily  employed,  we  also  ought  to 
examine  ways  in  which  we  might  extend  as  well  as  utilize  received  constructs 
from  micro-economics.  That  is,  we  ought  to  be  able  to  use  this  new  kind  of 
production  function  to  obtain  new  results  even  while  attending  to  the  need 
for  maintaining  contact  (as  in  the  preceding  section)  with  already  available 
theorems  and  results  from  micro-economics. 

For  this  purpose  we  now  turn  to  a new  extension  of  the  customary 
definition  of  efficiency  as  the  ratio  of  output  to  input  for  the  single 
input-output  case  (usually  with  inputs  and  outputs  measured  in  the  same  units) 
to  the  case  of  multiple  inputs  and  outputs.  We  shall  do  this,  moreover,  in  a 
way  that  leads  to  a generalization  of  the  preceding  developments  into  the 
case  of  multiple  output  firms  in  a very  natural  way.  This  multiple  output 
case,  it  may  be  recalled,  is  likely  to  be  prominent  in  the  public  policy 
(governmental  program)  areas  which  are  of  interest  to  us.  We  also  want  to  do 
this  in  a way  that  releases  us  from  the  need  for  relying  on  data  such  as 
market  prices  which  are  of  only  limited  value  (and  availability)  for  many  of 
these  applications. 

To  achieve  these  objectives  we  note  first  that  if  we  were  to  weight 
each  firm's  inputs  and  sum,  we  could  get  a scalar  measure  that  we  might  refer 
to  as  a "virtual  input."  Similarly,  we  could  weight  the  firm's  outputs  and 
sum,  to  get  a "virtual  output."  Then  we  could  take  the  resulting  ratio  to  be 


See,  e.g. 


Van  Nostrand's  Scientific  Encyclopedia  [AO]. 
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a "virtual  efficiency"  for  this  firm,  provided  its  value  was  between 
zero  and  one.  Clearly  there  are  an  Infinite  number  of  possible  choices 
of  such  weights  and  their  consequent  measures  of  efficiency. 

To  free  our  definition  of  efficiency  from  arbitrary  choices  of 
these  weights  and  to  guarantee  each  firm  the  highest  efficiency  rating  it 
can  receive  from  observational  data,  while  rating  all  firms  with  efficiency 
between  zero  and  one  using  these  same  weights,  we  define  efficiency  for  a 
designated  firm  as  the  maximum  virtual  efficiency  obtainable  with  non- 
negative weights  that  impute  an  efficiency  between  zero  and  one  to  every 
firm  in  the  comparison  set.  In  formulas,  we  have 


max  h *» 


Z u y 
, r'ro 
r=l 


o m 


I v.x, 
i-i  1 10 


(20) 


subject  to 


E u y 


r*=l 


r'rj 


m 

I viXlj 


i-1 


i 1,  j-1. 


n 


where  ur,  v^  > 0 are  weights  applied  simultaneously  to  the  j * 1 n 

firms  observed  to  utilize  common  inputs  i * 1,  . . . , m in  amounts  x^  and 
produce  common  outputs  in  amounts  y^j,  r*l,  •••»  s*  The  subscript  zero  is 
reserved  for  the  designated  firm. 

To  recapitulate,  each  firm  will  have  its  efficiency  rated  relative 


to  every  other  firm  producing  the  same  outputs  from  the  same  kinds  of 
Inputs.  This  is  to  say  (again)  that  the  efficiency  rating  will 
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be  relative  to  the  comparison  set.  With  this  understanding  the 
resulting  rating  is  secured  from  the  indicated  ratio  by  means  of  an 
optimal  (maximizing)  choice  of  non-negative  weights  in  such  a way  that 
no  firm  is  rated  more  than  100Z  efficient. 

In  our  case  all  input  and  output  observations  are  positive  and 
so  it  is  not  necessary  to  deal  with  indeterminate  or  nonsolution  possibilities. 
In  any  case  the  above  formulation  is  an  extended  (nonlinear)  formulation 
of  an  ordinary  fractional  programming  problem  for  which  we  have  elsewhere 
supplied  a complete  theory.*  This  means  that  we  can  deal  with  such 
possibilities  in  already  known  ways  and  hence  need  not  turn  aside  to  deal 
with  issues  such  as  possible  indeterrainacies  in  any  case. 


*See  [10]  and  [7]. 
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Thus  ve  nov  proceed  Co  relate  the  above  measure  to  other  parts 
of  our  analysis  by  utilizing  obvious  manipulations  to  replace  it  with  the 
following  formulation  for  an  "inefficiency  measure": 


,E,  Yi« 

min  f - 111 

° s 

E u y 
, r'ro 
r-1 


subject  to 


E v x 
i-1  3 


Z u v 


^ 1 » J*1 » • • • » n 


V ur  - °‘ 

Now  we  propose  to  replace  this  formulation  with  an 
ordinary  linear  programming  problem  as  follows.  First  consider 


subject  to 


- E y.X.  +y  z < 0,  r*l,  ...»  s 
j-1  fj  j ro  o - ’ 


jfi  V* 


1 xio*  iml ■ 


Xj  - °»  J*1 n 


Because  (22)  is  an  ordinary  linear  programming  problem  it  has  a linear 
programing  dual  which  we  can  write  as  follows: 


r 
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Min  g = E u.x. 
6o  . , i io 
i=*l 


subject  to 


- s Vh  + 1 uixii  i 0 

r-l  r rj  i-1  1 13 


E y y 
, r'ro 
r=l 


1 0 • 


Because  of  the  condition  E p y * 1 in  (23)  one  can  recognize 

r-l 

that  it  is  equivalent  to  an  ordinary  linear  fractional  programming  problem. 
In  fact,  utilizing  the  theory  of  linear  fractional  programming  with  the 
transformation 


- tv^,  i-1,  m 


Wr  = tur,  r-l,  ....  s. 


which,  with  t > 0,  gives  explicitly 


min  f 


Z v.x. 
i-i  1 10 


o s 


E u y 
i r ro 
r-l 


subject  to 


Z v x - E u y > 0,  j-1,  ...,  n 

i-1  J r-l  J 


V ur  — 0 ’ 


as  the  equivalent  of  (23).  By  very  evident  manipulations,  however,  we 
can  see  that  (25)  is  the  same  as  (21).  Q.E.D. 
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We  are  now  in  an  advantageous  position  from  several  standpoints. 

We  have  a completely  symmetric  definition  of  efficiency  which 
generalizes  single  output  ratio  definitions 

not  only  In  economics  but  in  engineering  and  other  natural  sciences.^  We  do 
not  need  to  solve  the  nonlinear  (and  nonconvex)  problems  in  which  these 
definitions  are  formalized.  We  need  only  solve  the  ordinary  linear  program- 
ming problem  (23)  in  order  to  obtain  both  the  optimal  f or  h and  the 

o o 

* * 

weights  v^,  ur  _>  0. 


This  can  be  seen  by  observing  that  multiplication  of  all  expres- 
sion in  (25)  by  any  t > 0 leaves  the  optimal  value  of  f unaffected.  Thus, 

* 

choosing  t * 1 we  have  at  once 

, „ * * * 

(26.»  - gc  - zo 


and  therefore 

(26.2)  h*  = 1/z * = TEF(P  ) 

o o o 

as  well  as  the  wanted  (relative)  weights. 

Thus  nothing  more  is  required  than  the  solution  of  (23)  or  (22) 

* * 
in  order  to  determine  whether  f > 1 or,  correspondingly,  whether  hQ  < 1, 

with  efficiency  prevailing  if  and  only  if 

(26.3)  f*  = h*  = 1. 

o o 

In  this  way  we  have  dispensed  with  condition  ii  in  (13),  as  promised. 

* * 

Hence,  our  use  of  the  single  characterization  for  TEF(P  ) = h * 1/z  is 

o o o 

justified  when  only  this  measure  efficiency  is  wanted. 


See,  e.g.. 


[40]. 
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For  production  function  and  other  analyses,  however,  we  may  need  to  retu 
to  condition  (ii)  in  (13)  for  various  adjustments.  To  indicate  some  of 
these  possibilities,  we  should  first  recognize  that  (22)  is  a generalization 
of  (14.1)  in  which  (7.1)  is  replaced  by  new  observational  vectors 


(27) 


wherein  the  subvector  contains  the  observed  output  values  y^ , r*l,  . . . , s 
for  its  components  and  the  subvector  contains  the  observed  input  values 

xij*  •••»  m* 

Before  proceeding  further  we  need  to  make  the  following  remarks. 

Because  of  the  presence  of  multiple  outputs  we  have  omitted  the  use  of 

norms  and  thus  record  these  y^  and  x^  values  directly.*'  I.e.,  we  do 

not  normalize  on  output  as  we  did  in  (7.1)  and  (7.2).  In  fact,  the  concept 

of  an  isoquant  becomes  ambiguous  under  such  multiple  output  analysis  and 

indeed  the  concept  of  a production  function  gives  way  to  more  general 

concepts  such  as  "production  possibility  sets"  and  the  "activities"  to 

2 

which  they  are  related.  Efficiency  and  related  concepts  continue  to 
maintain,  however,  as  do  our  methods  of  securing  estimates  of  the  activity 

3 

coefficients  from  empirical  data  via  the  mathematical  programming  dual  (22). 

Adjustments  are  needed  in  both  concept  and  method  for  these 
more  complex  multiple  output  situations,  of  course,  but  these  are  minor 


The  norming  occurs  in  a more  recondite  form  in  the  final  constraints  of  (23). 

2 

Cf.,  e.g..  Arrow  and  Hahn  [4],  p.  52  ff. 

3 

I.e.,  (23).  We  note  that  there  is  also  a duality  theory  for  fractional 
programming  which  remains  to  be  exploited  for  (20)  and  (21).  See 
Bector  [5],  Jaganathan  [23],  and  Schaible  [34]. 


■ 


- 48  - 


and  relatively  straightforward.  Hence,  we  will  not  deal  with  them  here  in 
order  to  enable  us  to  devote  our  remaining  attention  to  the  kinds  of  adjustments 
that  are  needed  when  we  want  to  ensure  that  the  resulting  (adjusted) 
observations  are  on  the  relevant  efficiency  frontiers. 

Consider  therefore  the  following  reformulation  of  (22): 


Max  z 


Z Y A + Y z <0 

j-l  J j o o - 


n 

Z X A 
j«l  J J 


*j  >.  o,  ■ j**l n . 

Let  its  optimal  solution  be  represented 

(29)  z*.  s*+,  s*~,  A*,  j-1,  ....  n, 

where  s*+  represents  a vector  of  non-negative  slack  associated  with  the 
output  additions  and  s*  represents  a vector  of  non-negative  slack  asso- 
ciates with  input  subtractions.  If  z > 1 or  if  any  component  of  s*+  or  s* 
is  positive  then  via  (13)  the  efficient  frontier  of  the  production  possi- 
bility surface  has  not  been  attained. 

If  the  observations  for  this  firm  are  to  be  utilized  in  further 
analysis  of  a micro-theoretic  variety  it  is  necessary  to  bring  them 

onto  the  surface  in  the  following  manner.  All  output  must  be  scaled  upward 
* 

by  the  multiple  zq  1.  Also  all  slack  must  be  reduced  to  zero  by  increasing 
the  outputs  or  decreasing  the  inputs  for  which  non-zero  slack  appeared  in  (23) . 
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To  make  these  statements  more  precise  and  to  show  that  a point 
on  the  efficiency  frontier  is  thereby  produced,  we  utilize  the  data  of  (29) 
and  replace  (28)  with  the  following  new  problem: 


Max  z 


- Z Y X + (Y  z + s*  )z  <0 
j = l J J ° 0 ° 


E XX 
j-1  3 3 


< X - s* 
— o 


Xj-°- 

We  shall  refer  to  (30)  as  the  "varied  problem"  and  show  that  the  thus 
adjusted  observations  satisfy  the  conditions  for  efficiency  - (13)  ff  - 
as  follows.  Evidently  we  can  secure  zq  » 1 since  zq  **  1 together  with 
(29)  gives  us  the  already  secured  optimal  solution  to  (28).  Now  suppose 
we  could  have  zq  > 1 in  (30).  This  would  yield 


nA‘*  *+* 

"E  Y.X  + Y z 7 < - E Y X,  + (Y  z + s*  ) z < 0 

jBl  j j o o o - .ml  j j 


‘ E XX 
j=l  3 3 


< X - s*  < X . 
— o — o’ 


since  s*  and  s*  are  both  non-negative.  Evidently,  the  expressions  on 

it  A* 

the  left  then  satisfy  the  "unvaried  problem"  (28)  with  z z in  place  of 

o o 

jfr  it 

z and  X.  in  place  of  X.  . However,  then  also 
o j J 


* a*  * 

Max  z > z z > z 
o—oo  o 

when  z*  > 1.  But  z*  « Max  z , by  hypothesis.  Thus  a contradiction  occurs 
o o o 

/v*  , 

which  can  be  resolved  only  by  assuming  z * 1 as  the  optimal  value  for  the 


varied  problem  (30). 
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Now  we  want  to  show  that  the  optimal  solution,  X^ , j-1,  ....  n 
to  the  unvaried  problem  (28)  is  an  optimal  solution  to  the  varied  problem 
(30)  with  zero  slack,  i.e.,  the  vectors  s*+  and  s*  have  zeros  in  all  com- 
ponents as  required  for  efficiency.  First,  we  show  via  (29)  and  (30)  that 

^ ic  ic  4. 

- E Y.X,  + Y z + s*  - 0 
j-'l  J j °° 


n * 
E XX 

j-1  3 3 


X - s* 
o 


is  a feasible  solution  of  the  varied  problem  with  z **  1.  That  is 


® ^ a + 

- E Y.X,  + (Y  z + s*  ) z = 0 

H11  " 


^ if 

E X,X. 
j-1  j J 


X - s* 
o 


with  z = 1.  However,  with  z - 1 the  X,  from  the  original  unvaried 
o 0 j 

problem  continue  to  satisfy  the  constraints  and  are  optimal  for  the  varied 

"k  4.  — 

problem,  too,  after  the  zq,  s*  and  s*  adjustments  to  efficiency  are 
effected.  Q.E.D.  In  short,  the  indicated  adjustments  do,  in  fact,  always 


bring  the  original  observations  to  the  relevant  efficiency  frontier.  No 

* + 

new  computations  are  required  after  the  zq,  s*  and  s*  adjustments  are 
effected  for  the  original  data  for  the  efficiency  comparisons  we 

may  subsequently  want  to  make  since  all  observations  are  on  the  efficiency 


surface  as  required. 


9.  Concluding  Remarks 


The  adjustment  procedures  we  have  just  described  can  be  put  to 
use  in  a variety  of  ways.  They  may  be  used,  for  instance,  to  distinguish 
between  managerial  decisions  and  other  sources  of  efficiency  as  in  the  Data 
Envelopment  Analyses  which  were  discussed  in  our  introduction-  See  [30]. 

They  can  be  used  in  other  ways  as  well.  For  instance  Farrell’s 
technical  efficiency  concept  was  employed  by  Carlsson  [6]  in  an  attempt  to 
measure  the  (relative)  efficiency  of  Swedish  firms  by  reference  to  Leiben- 
stein's  concept  of  "X-eff iciency . The  developments  set  forth  in  our 
paper  should  be  helpful  in  meeting  criticisms  of  such  work  as  was  leveled  at 
Carlsson  by  FfJrsund  and  Hjalmarson  in  [21].  This  is  by  virtue  of  the  fact  that  oi 
formulations  can  handle  large  numbers  of  constraints  and  almost  limitless 
numbers  of  observational  vectors.  Thus  in  this  way  we  can  go  a consider- 
able distance  in  admitting  different  capital  vintages,  say,  or  different 

classes  of  labor  for  explicit  treatment  as  possible  sources  of  variations 
2 

in  efficiency. 

Data  availability  at  the  level  of  individual  decision-making  units 
may  cause  problems,  although  this  is  likely  to  be  more  serious  in  deal- 
ing with  private  enterprise  statistics  than  the  kind  of  public  programs  which 
form  the  main  concern  of  the  present  paper.  For  cases  in  which  aggrega- 
tions or  disaggregations  are  wanted,  however,  as  for  the  synthesis  of  a variety  of 
different  production  functions  from  the  same  underlying  data,  we  can  pro- 
vide the  following  general  guidelines. 

*See  [27]  and  [38].  Although  this  view  has  recently  come 
under  severe  criticisms  by  Stigler  [38],  no  empirical  evidence  was  sub- 
mitted in  Stigler's  argument  to  refute  Leibenstein's  claim  of  the  existence 
of  such  inefficiency. 

2 

Actually  it  was  Farrell's  criticism  of  Colin  Clark's  omission  of  some  factors 
of  production  (to  account  for  some  of  Clark's  results)  that  appeared  to  • 
motivate  Farrell's  original  work  in  [18], 
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In  synthesizing  from  among  the  many  possibilities  for  defining 
such  aggregate  (or  subaggregate)  functions,  properties  like  the  following 
need  to  be  kept  in  mind.  A "representative"  efficient  surface  (e.g.,  the 
unit  isoquant  in  the  single  output  case)  consists  of  facets,  each  of  which 
is  a simplex.  Each  such  simplex  spans  a convex  cone.  Thus  when  consider- 
ing the  aggregation,  i.e.,  the  summing,  of  two  or  more  inputs,  the  related 
efficient  aggregate  output  will  be  the  sum  (or  aggregate)  of  the  efficient 
outputs  of  the  individual  firms  as  long  as  they  are  in  the  same  cone.  If 
the  individual  inputs  are  from  different  cones,  the  aggregate  input  may  lie 
in  any  one  of  the  individual  cones  and  it  can  lie  in  other  cones  as  well. 

In  this  case  (i.e.,  at  least  two  inputs  from  different  cones),  the  effi- 
cient aggregate  output  will  exceed  the  sum  of  the  efficient  individual 
outputs  by  virtue  of  the  strict  sub-additivity^  of  our  efficient  production 
function  for  input  vectors  from  different  cones. 

Other  relations  of  disaggregation  and  aggregation  may  also  be 
employed,  of  course,  according  to  purposes  that  might  be  served  thereby. 
Inter  alia,  none  of  the  above  is  intended  to  imply  that  customary  approaches 
to  the  study  of  empirical  production  relations  must  necessarily  be  aban- 
doned. It  means  rather  that  we  now  have  numerous  other  alternatives 
that  can  be  shaped  and  applied  for  different  purposes  as  the  contexts  may 
suggest. 
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Some  of  the  advantages  and  possibilities  for  use  having  been  indi- 
cated at  various  points  in  the  text,  we  can  best  conclude  with  some  of  the 
possible  limitations.  In  addition  to  potential  lack  of  data  availability 
at  individual  firm  levels,  we  have  also  noted  that  our  approach  has  been 
mathematical  (i.e.,  deterministic)  rather  than  statistical  in  character. 

We  have  already  indicated  some  of  our  reasons  for  delaying  a treatment  of 
this  subject  since  we  propose  to  come  at  it  in  an  entirely  different  way 
via  the  techniques  that  we  refer  to  as  Data  Envelopment  Analysis.  For  the 
present,  however,  we  must  admit  that  our  treatment  of  these  statistical 
considerations  has  been  weak  since,  in  principle  at  least,  one  must  admit  to  the 
presence  of  statistical  error  even  when  all-inclusive  data  are  at  hand.  Of 
course  other  alternatives  to  Data  Envelopment  Analysis  mav  also  be  explored 
as  in  the  treatments  by  Timmer  [39]  among  others,  who,  following 
up  a suggestion  by  Aigner  and  Chu  [ 3], has  employed  a "chance  constrained 
programming"  formulation  — but  without  really  exploiting  anything  like  the 
full  range  of  formal  theory  that  is  available  from  that  quarter.* 

Finally  we  turn  to  the  measure  of  efficiency.  We  have  repeatedly 
noted  its  restriction  to  relative  efficiency.  For  the  kinds  of  public 
policy  applications  with  which  we  are  concerned,  this  provides  an  approach 
to  controlling  and  evaluating  managerial  behavior.  By  this  we  mean  that 
managers  who  fail  to  meet  the  indicated  efficiency  level  can  at  least  be 
confronted  with  the  evidence  and  required  to  justify  their  departure 
from  what  "the  evidence"  suggests  is  attainable.  In  this  manner  a basis 
for  further  learning  can  also  be  provided  since  any  special  or  ameliorating 
circumstances  revealed  in  the  course  of  such  inquiries  can  then  be 
introduced  as  constraints  into  future  models  and  measurement  results. 

*Cf.,  e.g.,  [8], 
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As  movement  between  industries  or  even  regions  becomes  a pertinent 
issue,  however,  the  case  for  this  measure  of  efficiency  begins  to  weaken. 

In  golfing  terminology  it  is,  so  to  speak,  a measure  of  "distance"  rather 
than  "direction"  with  respect  to  what  has  been  (and  might  be)  accomplished. 

As  such  it  is  inferior  to  "profit"  which  measures  direction  as  well  as 
distance  and  possibly  other  aspects  of  attainment,  including,  e.g.,  the  best 
among  all  available  alternatives,  whenever  it  is  really  applicable.  On  the 
other  hand,  most  public  endeavors  (schools,  police,  etc.)  are  not  free  to 
transfer  resources  simply  because  some  other  more  attractive  alternatives 
exist.  Hence  the  fact  that  their  resources  are  committed  in  an  indicated 
direction  suggests  that  such  measures  of  accomplishment  as  we  have  considered 
here  are  appropriate  and  they  are  certainly  needed  for  at  least  some 
purposes  of  policy  evaluation  and  prescription. 


APPENDIX 


In  all  of  our  treatments  in  this  paper  an  important  consideration  is  the 
convexity  of  the  set  of  production  possibilities.  There  may,  of  course,  be 
many  different  measures  for  the  amount  of  inputs  or  outputs  and  so  we  need 
to  consider  how  the  resulting  measure  of  efficiency  may  be  altered  as  these 
input  and  output  measures  are  altered. 

For  clarity  we  consider  the  single  output  case.  There  may  even 

be  transformations  of  the  n+1  coordinates  of  inputs  plus  outputs  into  other 

coordinate  systems  (with  n+1  coordinates)  in  which,  say,  a nonconvex  set 

of  production  possibilities  is  transformed  into  a convex  set.  See  section  1 

of  the  paper.  Provided  that  such  a transformation  is  "bicontinuous, " ^ 

however,  boundary  points  remain  boundary  points  and  interior  points  remain 

interior  points.  Thus  when  there  are  two  different  coordinate  representations 

both  of  which  have  convex  sets  of  production  possibilities,  efficient  points 

remain  as  such  in  both  systems  and  the  same  is  true  for  nonefficient  points. 

Hence  the  characterization  of  any  point  as  efficient  or  nonefficient  will 

remain  invariant  under  any  such  transformation.  The  numerical  magnitude 

may,  of  course,  change.  Specific  examples  can  be  given,  but  it  is  important 

to  note  that  even  this  numerical  value  remains  invariant  under  any  "homo- 

thetic"  transformation.  The  latter,  which  refers  to  constant  changes  of 

2 

scale,  is  the  one  that  will  usually  be  of  interest  and  hence,  in  this 
respect,  too,  we  can  build  on  the  important  opening  provided  by  Farrell 
[18]  for  empirical  work  and  related  policy  applications.  In  fact,  as  we 
elsewhere  show  [30]  new  techniques  of  data  analysis  can  then  be  developed 
for  added  use  in  cases  where  more  customary  statistical  and  econometric 
techniques  may  otherwise  fail  to  provide  what  is  wanted. 


^See,  e.g.,  J.  L.  Kelley  [25]. 

2 

See  the  discussion  in  Shephard  [37]. 
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